PyDigger - unearthing stuff about Python


NameVersionSummarydate
trl 0.20.0 Train transformer language models with reinforcement learning. 2025-07-29 04:10:06
trl-fpo 0.0.14 Train transformer language models with reinforcement learning. 2025-01-18 04:51:57
nemo-aligner 0.6.0 NeMo-Aligner - a toolkit for model alignment 2025-01-07 23:05:48
shtec-rlhf 1.0.5 shtec-rlhf: Safe Reinforcement Learning from Human Feedback 2024-06-24 05:55:07
hourdayweektotal
91227110313304353
Elapsed time: 3.38602s